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Abstract: Congestion on the Internet is an old problem but still a subject of intensive research. The 
TCP protocol with its AIMD (Additive Increase and Multiplicative Decrease) behavior hides very challenging 
problems; one of them is to understand the interaction between a large number of users with delayed feedback. 

This article will focus on two modeling issues of TCP which appeared to be important to tackle concrete 
scenarios when implementing the model proposed in [7]; firstly the modeling of the maximum TCP window size: 
this maximum can be reached quickly in many practical cases; secondly the delay structure: the usual Little-like 
formula behaves really poorly when queuing delays are variable, and may change dramatically the evolution 
of the predicted queue size, which makes it useless to study drop-tail or RED (Random Early Detection) 
mechanisms. 

Within proposed TCP modeling improvements, we are enabled to look at a concrete example where RED 
should be used in FIFO routers instead of letting the default drop-tail happen. We study mathematically fixed 
points of the window size distribution and local stability of RED. An interesting case is when RED operates at 
the limit when the congestion starts, it avoids unwanted loss of bandwidth and delay variations. 
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A simple stability condition for RED using TCP mean field modeling 



Resume : Le controle de congestion dans Internet est depuis longtemps le sujet de recherches poussees. 
Le protocole TCP avec son comportement AIMD (pour accroissements lineaire, decroissance multiplicative en 
anglais) cache des problemes excessivement compliques. L'un d'entre eux est de comprendre l'interaction entre 
de nombreux utilisateurs avec un delai de reponse du systeme. 

Ce rapport va se focaliser sur deux points dans la modelisation de TCP. Ces points sont apparus important 
lorsque nous avons voulu confronter a des scenarii concrets le modele propose dans [7]; Tout d'abord la 
modelisation de la fenetre maximale de TCP: cette valeur peur etre atteinte tres facilement dans la pratique; 
ensuite, la structure des delais: la formule type Little habituellement employee donne des resultats loin de la 
realite quand les delais sont variables. Cette hypothese de modelisation a un impact important sur la taille 
predite de la file d'attente ce qui rend vaines les tentatives de comparaison entre les mecanismes drop-tail et 
RED. 

Grace a ces ameliorations apportees au modele, nous sommes capables dans un cas precis d'etudier comment 
parametrer RED dans des routeurs FIFO pour qu'il ameliore les performances par rapport au cas par defaut 
de drop-tail. Nous etudions mathematiquement les points fixes et la distribution de la taille des fenetres et la 
stabilite locale de RED. Un cas interessant est quand RED se trouve dans son domaine de fonctionnement au 
debut de la congestion, il evite une mauvaise utilisation de la bande passante et des variations dans les delais. 



Mots-cles : TCP, AQM, drop-tail, RED, congestion 
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1 Introduction 

1.1 TCP router control issue 

TCP achieves a distributed congestion control of the Internet. This article proves a usable closed formula RED 
stability. RED (Random Early Detection) was introduced by Floyd in [14]; it is to be deployed at a router to 
send congestion information to TCP Reno end users. The idea behind RED is that the first sign of congestion 
is when the router queue starts to be used more than to buffer normal traffic fluctuations; then the buffer gets 
full and the drop-tail mechanism destroys packets arriving without any room left in the queue to fit in. 

Drop-tail leads to two issues: firstly, the queue size oscillations provoke delay jitters - this has detrimental 
effects for applications using TCP for realtime content - secondly, drop-tail synchronizes sources, resulting in 
bandwidth under- utilization of the congested link (this idea was first introduced in [41] for TCP Tahoe). This 
serves as leverage because at the time bandwidth demand reaches capacity, the goodput diminishes by the 
synchronization effect, worsening the starting congestion (this assertion will be explained clearly later). 

These two main reasons explain the interest for RED and other AQM (Active Queue Management) to deal 
with TCP congestion at the router level. RED often works in an admirable way, leading to reduced queuing 
delays, avoiding jitters and reaching optimal bandwidth utilization... but sometimes RED performs worse than 
doing nothing at all (drop-tail). This is the reason why many system administrators are reluctant to use RED 
although it is deployed in almost every router of the Internet. This paper will show how to tune RED in a way 
it is sometimes optimal and always better than drop-tail. 

1.2 Our previous works and motivation 

In [7,29] and [40] we investigated mean field TCP modeling by continuing the fluid TCP model introduced and 
studied in [15,25,31]. Despite the interesting results arising from the models, there were still some difficulties in 
understanding the original problem of tuning RED and comparing it accurately to drop-tail. Two points needed 
to be addressed. Firstly, with the development of high speed access, it becomes difficult to suppose that TCP 
always works within its congestion avoidance mode in a AIMD manner. The size of packets of the order of lkB 
makes the maximum TCP window size relatively small (most common packet size is around lAkB). We shall 
say Wraax = 64 packets even if the receiver does not impose any reception window limitation. This fact is due 
to the coding of the window size on 16 bits addressing window by Bytes (2 16 B = QAkB). 

Secondly, another limiting modeling assumption is a fact noticed by Hong in [16]: when the queue is not 
empty, acknowledgements arrive obviously at the congested router bandwidth. This remark is crucial because 
TCP dynamic is very sensitive to the delayed feedback. 

1.3 Outline 

Section explains and defines our model; then we study the steady state window distribution with a maximal 
window size in section Next step consists in seeking a stability region for the RED algorithm, which is done 
in section Q] We finish with showing simulation results on a concrete example in sectional This last example 
shows how to use previous results to configure RED in a router in order to avoid collapse at the early stages of 
congestion. 

1.4 New results 

Whereas modeling W max and ACK bandwidth are not new ideas (see [34] and [16]), adapting them in mean 
field equations to obtain accurate evolution equations together with the window distribution constitutes a step 
forward. The steady state solution for the window distribution taking into account the W max phenomenon in 
sectionals an extension of [7] which is important from a practical point of view. In section^) the stability result 
obtained for RED with theoremQJis a very simple closed formula. Finally the example in section explains how 
RED should be tuned to increase router efficiency, this is an important result because, as we said, the suggested 
tuning can be applied without any hardware modification in almost every router by enabling RED. 

2 Model and equations 

A number N, relatively large, of users share a common bottleneck router (figure to see the modeled net- 
work topology). We can consider the histogram of users' congestion window sizes; in [29], we saw that this 
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histogram converges "gently" to a deterministic window distribution as N tends to infinity. In [7], we studied 
this asymptotic distribution which satisfies a partial differential equation the results were applicable even for 
small numbers (N — 25 for RED, and N = 10 for a drop-tail). Hereinafter we adapt the partial differential 
equations first presented in [7]; this is done in a way we could prove the mean field limit as we did in [29], but 
we shall not enter in such developments in this article. 



User 1, Wi 



User 2, W2 



User N, Wn 



100 Mb/s, 
ms 



Queue, 235 packets 




lGb/s, 
4 ms 



TCP sink 



Figure 1: Modeled topology and simulated scenario, N is variable. 



2.1 Evolution of congestion window sizes 

Imagine users have a notification of losses of the form n(t) in proportion of the incoming acknowledgement flow. 
Denote by A(t) a function indicating the flow evolution of windows (for example A(t) = rt ]^ in usual TCP 
models). The question of delays and how the functions A and k evolve come later in the article; these questions 
are not relevant to study the intrinsic user congestion window size evolution. Then, the distribution of window 
sizes is of the form 

D(t, w) = p(t, w)dw + M (t)5 Wmax ■ 
Which leads to two equations, the PDE: 



1 *(*,«,) + £(*,«,) = (1) 



A(t) dt v ' ' dw 
n(t) (4wp(t, 2w)xw<w ma!C /2 ~ wp{t, w)xw<w ma:c ) 
+ Sw^ 1L M(t)K(t)W max 

and 

j^-^(t)=p(t,W max )-M(t)K(t)W max - (2) 

Intuitively, the coefficient A(t) is the incoming bandwidth; when it is small, the window sizes have a slow 
reaction, when it is large, they react in a faster way. The coefficients Xw<w mam only indicate that the window 
size cannot be larger than W max . When no loss occurs, the coefficient ^(t,w) indicates that the window size 
increases linearly. When losses arise, the coefficient K enables —Kw(p(w)), which means that a certain proportion 
of users that were at window w change to another value of the window size; it also enables 4Kivp(t, 2w), which 
means that users that were at window 2w and 2w + 1 (or 2w — 1) move to window w. 



2.2 Delay in the system 
2.2.1 Limits of Little-like formula 

As noticed in [16] and in [40], the Little-like approximation made in usual TCP models (for example [7,15,35]) 
lacks realism and strongly limits the way models can explain reality. This approximation consists in saying 
that at time t, the bandwidth B(t) of a user is related to the RTT, R(t) (Round- Trip Time) and its congestion 



RR n° 0123456789 



6 



Julien Reynier 



window size W(t) and by B(t) = W(t)/R(t). If the RTT is almost constant (for instance close to the propagation 
delay), it is a rather acceptable simplification, whereas when R(t) is variable, the model can lead to unacceptable 
consequences: it is easy to understand that when one wants to study the stability of RED (with a non empty 
queue), saying B(t) — W(t)/R(t) or B(t) is constant entails different conclusions. 

2.2.2 How to improve delay model 

The idea comes from [8] where a simple delay line is introduced to study the limit behavior (when the bandwidth 
tends to infinity) of one user implementing MulTCP or scalable TCP ( [12,20,21]). Although the use of a delay 
line complicates equations, the model is still easy to simulate. Furthermore local stability of fixed points can 
be studied mathematically. Here we will adapt delay line modeling to large number of TCP Reno users. 

2.2.3 Delay equations 

Delay and queue size Let us introduce Q(t) the queue size mesured in seconds, the router is supposed FIFO 
(in other words, Q{t) is the queuing delay). Denote by K(t), the destruction probability for a packet entering 
the queue; call Bi(t) the incoming bandwidth to the queue and B Q (t) the outgoing bandwidth, scaled by the 
number of users. C is the router capacity per user. Then: 

B ( min(C,B i (t))XQ(t) = 

°^ ' \ C else. ^ ' 

RTT Call R(t) = T + Q(r(t)), the RTT virtually written by the queue on packet arriving the router r(t). 
This packet becomes an ACK that generates new packets where the value is copied. By definition we say that 
this value comes back at the router router at time t, Which makes t = r(t) + R(t) leading to the relation: 

R{t)=T + Q(t- R(t)). (4) 

We discussed in [7] the fact that this implicit equation can also be written: R(t + T + Q{t)) = T + Q(t), which 
does not raise any definition issues and is easier for numerical computations. 

Advance A(t) of window sizes The window size approximately increases by one every W(t) arrived packets. 
The incoming bandwidth for a given user is the probability that the packet is one of his multiplied by the total 
bandwidth: 

Ef w,(t - mm 

In fact we want to compute the advance of window size, which means that destroyed non-arriving packets 
carry information. Thus the modified ACK bandwidth is: 

1 W 3 (t-R(t)) 

l-K(t-R(t)) E f Wi (t-R(t)) ° [ h 

The factor of advance we shall use in window sizes evolution equations is: 

^ = l-K(l-R(t))F(t-R(t)) B ^ - n (5) 
where F(t) — jj = J wp(t,w)dw represents the number of packets on-the-flight. 

Loss rate indicator It is given by: 

K {t) = K(t-R(t)). (6) 

2.2.4 Bandwidth evolution when crossing the receiving user 

To go full circle 1 we need to say what is the value of the bandwidth Bi (t) knowing the window size evolutions 
and the ACK bandwidth B Q {t). The evolution of this number only comes from new packets being sent or ACK 
being received (counting as ACK the indication of a lost packet); thus: 

f(') = B,( 1 )- 1 _ g(t 1 _ fl(t)) B.( t -T). (7) 



X AKA: give the last equation 
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2.2.5 Generation of losses 

We will suppose that losses are generated by some AQM (Active Queue Management), or by letting the drop-tail 
mechanism work. The equations are (recall that Q is given is seconds): 

^ = B t (t)(l-K(t))-B (t) (8) 



and K{t) = f{Bi,Q), 

W-B 
S,(t) 



In the drop-tail case, for example, K(t) = Bi ^g XQ{t)=Q ma:c ■ For RED, with a loss function f(Q) and 



an averaging coefficient w q , K{t) = /(A /_ e x{s -^Q(s)ds) with A = -iVSjlog(l - w q ). Hereinafter we shall 
suppose the router uses RED with A very large which means that 

K{t) = f(Q(t)). (9) 

To achieve this we shall say in the following that w q = 1, but a weaker assumption is that A >> 1, which would 
allow us to set w q as so to admit bursts of packets without any losses when the total bandwidth is relatively 
large (see [14]). 

The model is completely specified by equations Q1I2I3I5I6I7I8I9I) and wc can now analyse stability. We first 
find fixed points with a constant loss indication, then we study the stability of these points supposing the use 
of a RED mechanism. 

3 Fixed point 

To study fixed points it is sufficient to study fixed points for window sizes. Other conditions follow immediately 
in paragraph |231 

3.1 Fixed point equations 

Eliminating t in equations (QJ and @ leads us to consider a distribution of window sizes of the form: 

D(w) — p(w)dw + MSw max , 

with the two equations: 

p'(w) = 4kwp(2w)xw<w max /2 ~ kw P( w )Xw<w ma:c (10) 
+MkW max 8 w max , 

2 

and 

MkW max =p{W max ). (11) 
The graphical representation of the solution found by MAPLE can be seen on figure [2 

3.2 Fixed point resolution 

Theorem 1 The solutions of the system \lf\ll)) on [ , w % l n am ] are: 



i=0 



where for n> 1 : 



i 4 n i kw ™ 

< = «o + MkW max e" — 

1L=i ( 4 - 1) 



and if i > 0: a" = j=p — a o ^ ne an ^ on ^V one solution is positive with integral 1. 

Given the formula, the theorem is simply verified by replacing the candidate solution in the equations. To 
see the intuition behind let us see the first two iterations. 
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W 



Figure 2: p(w) for W max — 64, with the constant drop probability 0.15%, the mass at W max is approximately 
3.3%. 

3.2.1 First iteration 

Forwe [^,W max ], 

p'(w) = —kwp(w), 

kw 2 kW 2 

thus p(w) = a$e~^~ . The limit condition says that p(W max ) = a®e 9^ — MkW max which entails: 

al = MkW max e 2 . 

3.2.2 Second iteration 

For^ePf-^L 

p'(w) — —kwp(w) + 4fcwaQe -4- ^. 
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By standard techniques we find the solution p(w) — a\e ™ — |a[]e 4 2 ; which means that a\ := — |<Zq. The 
limit condition determines a 1 , by saying that p ^ vy " 2 '° I = p ( w ™ ax J + MkW max , ie: 



giving the good value to take for aj: a\ := a® + MkW max jhj e 4 
3.3 Normalization 

3.3.1 Regularity properties 

We know a priori that for a well chosen value of the parameter M, D is a probability. In this section, we show 
a little more by saying that the density part p is continuous and has a limit at w = 0. 
Notice that by integrating the EDO (1 1 01) on [w, W max + 1] we obtain 

p(w) - 0= / kvD(v). (12) 

We saw the solution on , W ma x] which in particular is positive, thus p cannot reach for positive values 

of w (we already knew this by the fact that D has to be a probability). By construction p is continuous on 
(0, w ™ am ) , thus bounded on compact sets. Look at the explicit form of p we have just calculated. It is always 
series bounded by: 



< MW max J2 



2 



And the last series is convergent because its general term is equivalent to 4~ 4 ^~ 3 ^/ 2 . 
Now use the boundedness of p in the equation 1121 when w is close to 0: 

\p(w)\ = / kvp(v) < (2w) 2 k sup \p\ ; 

J w 

thus we see that p tends towards at w = 0. 
3.3.2 Computation of the integral 



rWmax 00 rW ma x/2 n n 2 

/ D(v) = M + J2 E a " e " 4 *^^ 



00 n 



, 2fc4 4 

n=0 i=0 



/ 4 J_n \ 

We already noticed that p(w) > 0, then it is not surprising that the previous sum converges very quickly by 

the conjugated effects of p being small and the size of the integration domain tending exponentially to 0. Then 
a very interesting result from the practical point of view is the proportion of users at W max function of the loss 
rate which is shown in figure A first order Taylor development of log(M) in k is immediate and: 

Theorem 2 

log(M) ~ -\kWl ax . 

For instance see that the case of figure El gives approximately M e -o.i5%x£§- _ 4 instead of the 
exact value 3.3%. This is not a very accurate approximation, but it gives a good order of magnitude for a first 
approach, still, it is easy to compute very good numerical values (see figure QJ. 
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Figure 3: log(M) (the mass at W max ) function of the drop probability k for W max = 64. 
3.4 Equilibrium values 

Denote by {Bf, B%, K e , Q e , R e ) a set of equilibrium values with a non empty queue, then from equations 
and 

B% = C 

i? e = T + Q e (13) 
K e = F(Bi, Q e ). 

At an equilibrium, the Little-like formula works (because the delay R e is constant) and: 

F e 

Bt = Te . (14) 
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Then the conservation equation Q (or equation (§J) and the advance equation © give: 

m = ^ (15) 

^ = i- (16) 

This last equation shows us that in a steady state the function of advance A is the one that usually appears 
in TCP models. We see that the equilibrium relations are of the same kind as those in [7]. 

An equivalent to the square root formula would be needed; obviously when K e is not too small the square 
root formula still applies (the W max limitation is negligible), but when it starts to increase, the limitation on 
the window size lowers the mean window size. Figure illustrates this result; we can see there that the square 
root formula is almost exact for K e > 0.15% 



RR n° 0123456789 



12 



Julien Reynier 




4 Stability analysis 
4.1 A first remark 

Recall that F(t) is the number of packets on the flight and let us call i*2(t) := w 2 D(w) the second moment of 
the probability D. Denote by a dot (□) the derivative of function □ with respect to the time t. Then combining 
equations (/ wx (QJ dw + W max J5J), we have: 

1 -F(t) = 1 - M(t) - \-K(t - R(t))F 2 (t), (17) 



A(t) w w 2 
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which leads to the equilibrium equation: 

I — M 

F% = 2 1 ——. (18) 



K 



4.2 Stability equations 



We study the stability of the fixed point (Bf , B%, K e , Q e ). We intend to study an equilibrium with a non-empty 
queue, this implies B a (t) — B e Q — C ' . 

The idea is to add a small perturbation of the form Aw on the window sizes at t — a time at which a 
fixed point has been reached. To simplify we suppose that the response is uniform and we denote it by Aw(t); 
the variations are truncated at the first order. This simplifications entails that the variation of the on the flight 
packets number F(t) = J2 W ^( W ) ls Ai^t) = Aw(t). 

The assumption on Aw permits to write Ai^(t) = 2F e AF; then taking the variation at first order in 
equation II17J1 gives: 

AF(t) — / A e AM{t) — A e K e F e AF 

AF{t > - { -±A e FiAK(t - i? e ). (19) 
This equation comes with the linearized version of 0: 

A MM - I A6A K*> W max ) - A-K e W max AM(t) 

[ ) ~ \ -A-M-W max AK{t - R*). ^ 



From equation @: 



Equation <jHJ leads to: 



AF(t) = AB.it) _° AK(t - W). (21) 



CAQ(t) = (1 - K e )ABi(t) - BfAK(t). (22) 
Finally the instantaneous RED control gives: 

AK{t) = eAQ{t). (23) 

where e is the slope of the RED control function at the equilibrium point Q e . We suppose here that the averaging 
factor w q is equal to 1 (which means that we only consider the instantaneous value of the queue to compute 
losses). 

Notice that the variations of RTT only create second order terms; this is the reason why equation iQJ does 
not have to be used. This is the same for equation because the fist term factors are always multiplied by 
the second term of the equations that have null equilibrium values. 

4.3 Differential equations with time delay 

The equations iflfljl . (j2f)l) . J2lj, il22ll . l(23jl can be reorganized as a system of three delay differential equations 
on F, Q and M. The only problem is that Ap(W max ,t) is not well determined in the equation ltl9|l . We shall 
make the further simplifying assumption that the term A e Ap(t,W max ) can be replaced by p e (W max )Aw = 
P e {W max )AF(t), which is intuitive since when all windows are increased by Aw, the additional number of users 
overtaking window W max is close to the announced number if we say that the window distribution stays close 
to the equilibrium one at the first order. 

The standard mathematical method to find the local stability condition with a linear delay differential 
equation is the following (see [9]; this is an equivalent to the Bode diagrams approach). Find a solution of the 
form A^e^ 4 where Aj and (j) are complex numbers, linearize the exponential factor coming from the delayed terms 
which corresponds to the replacement of terms AX(t — r) by (1 — reft) AX (t). We are led to one more unknown 
than equations, hopefully one of the A-s may be replaced by the value 1. Then solve the polynomial system to 
find (j). A necessary condition for the system to be stable is that the real parts of all solutions are negative. 
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Let us look for a solution of the form (AF, AM, AQ) = (e**, xe^, ye^). This leads us to (the equations are 
divided by e^ 4 and the expressions e^ R are linearized for 4>R e « 1): 

e (\ _ M e \ 

R e 4> = -K e F e -x- -i— - >-(l-R e (j))y (24) 

K 

R e <f>x = R e M e K e W max - K e W max x 

-M e W max e{l - R e <f>)y (25) 

1 - 

From lj2fil) and replacing using ijHj) and ltl5|l 



C^y = J^jlfc,. (26) 



(7 + eF e 

Then from using iflfiji : 



1 - JT e 

W = (27) 



-g = i? e + if e F e + (1 T M) \ l - R e ^)y- (28) 



Putting everything in <25li ; 



(i? e + iTW^) (-a) + # e M e X e W m(M 

-M e W mQX e(l - i? e 0)y = 0. (29) 

This is a second degree equation of the form: 

a<j) 2 + b(j> + c = 0, (30) 

with: 

a := (^(i-i^ej,) 
b := i? e [K e F e + f (1 - M") + K e W max 
-K e W max %(l - M e ) + ikTWW,] 
c := not needed. 

Lemma 1 The following properties are fulfilled: 

• a,b and c are real numbers, 

• Suppose a > 0, then both solutions have negative real values if and only ifb>0. 

Proof: The first point is true by definition. The second point is easy: if 4>\ and 02 are the roots of 1-iOII . then 
01 + 02 = — -• The coefficients are real, which ensures that 02 = 0i (the conjugated complex number), thus 
0i + 02 = 27?.e(0i), which grants our point. □ 

Theorem 3 A sufficient condition for RED with w q = 1 to be stable is: 

K e C , N 

e < i^W ^ 

Proof: First look at b; let U = - M e ) > 0, then: ± > if and only if: 

K e F e + U + KW max (l -U) + M e W max > 0. 

A sufficient condition is that U < 1 but: e < =^ e(l-M e )(l-K) < K e C, which implies e(l- M e ){l-K) < 

K e C + K e eF e , ie: U < 1. 

(j;°)2 = 1 — t/ > like we have just seen. Then Lemma ^ applies and gives the conclusion. 

To finish the proof, we need to say something about the assumption 4>R e << 1. An acceptable condition 
would be that j^R e « 1 (we only check the that the real part is small): 

h -w 1 - u 



2a K E F e + U + W max K e (l-U)+M e W„ 
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We see there that the RTT does not play an important role; this quantity is small if either K e W max or M e W max 
is large, which means that is is always a good approximation. □ 
Remark that from the proof a weaker stability condition for RED with w q = 1 is 

1 - K e e 

T ^ „ —= (1 - M e ) = U < 1. 

Corollary 1 IfW max — oo (the approximation made in [7]), then the stability condition for RED with w q = 1 
becomes in all the usual conditions 2 : 

KC 

e < = . 

1 - aVK - K 

with a ~ 1.310. 

Proof: The proof would be the same without the second equation on M. In that case we found in [7] the exact 
formula: F e = (this is one example of the well-known TCP square root formula). All this directly leads to: 

1 - K e e 

The conclusion is only a reorganization of this equation. □ 
We see that that the condition e < RC is a rule of the thumb valid in every case. The last corollary will be 
named theorem because it is the most important result of the article from a technical point of view. 

Theorem 4 A universal stability condition for RED is: 

a 2 

€ < 



(T + Q max: 

)W„ 



where a 2 « 1.7. For parameters T, Q max = Max t h = /3T, Minth = jT, and p n 
if: 

P+l W m ax 

Proof: Recall theorem |2] says that M e ~ e~% K w ma X when K e is close to 0, then: e < is a stability 

condition; which entails the result for K e close to 0, using the fact that the capacity for a user at the window 

W max is exactly w ™* . For other values, the square root formula implies that > ^i^K)(R"y i c > bW • 

To conclude, add the fact that R e < T + Qmax and the definition of RED. □ 



5 Simulation results 

The example we shall study is inspired by a real Internet provider configuration, it is illustrated by figure^] the 
mean field simulator can be downloaded at [39] . On a one giga-bit router in some part of the network the total 
propagation delay for end users is 10ms. The router is configured with a 2ms FIFO buffer (which is five time 
less than the usual delay bandwidth product rule). The faced problem is a jitter felt by end users. The size of 
packets is supposed to be IkB — 8192 bits and we shall say that the level 2 overhead is 40B; then the maximum 
congestion window size which is 64kB corresponds to 64 packets; the router capacity is 1.17e+5 packets per 
second and the buffer size corresponds to 235 packets. We also suppose that end users have a limited capacity 
at their access so that the packets do not arrive in bursts at the router (which is an assumption of our loss 
model); let us say that the limit is lQOMbits/s (and the buffer size at the access is unlimited). 
In [40], we saw that 10 users can be considered to be a large number for the drop-tail. When sources are less 
synchronized, the mean field simulations are always accurate for 25 or more users. We can see this on figure 
that the NS simulation of TCP Reno works close to our model which means that there are few timeouts and 
slow starts and that AIMD is a good model for fast recovery/fast retransmit. 

2 for K < 54% which is a lot larger than the limits tolerated by TCP that turn around 8% 
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Figure 6: Normalized queue size for 35 users with lkB packets on 1Gb/ s link and drop-tail policy (time is in 
seconds, propagation delay 10 ms, maximum queuing delay 2ms). NS-2 is dotted and mean field equations give 
the solid line. 

5.1 Results with drop-tail 

5.1.1 Before congestion happens 

As can be seen in figured for less than 19 users, the router capacity cannot be reached and the total throughput 
per user stays at w ™ a * *1024*8frzts « 52Mbits/s which is the maximum possible with the considered propagation 
delay and packet size TCP can allow. We see that from 20 to 22 users, the queue increases steadily from to 
its maximal value, so the RTT increases from 10ms to 12ms. Remark that a stable queue close to its maximal 
value is something that should be avoided because it leaves too little room for fluctuations to be smoothed. For 
23 and 24 users, the queue starts oscillating, but the bandwidth still stays around its maximum. 

5.1.2 The early congestion phase 

From 25 users, both NS-2 and the mean field equations show an extremely bad behavior: the utilization drops 
to 99% for NS-2. Then utilization drops to a worst utilization of 96% around 40 users, this can be explained 
by an increasing synchronization between users. 

5.1.3 Strong congestion phase 

For more than 50 users, the utilization starts to increase because the mean window size decreases: although 
the synchronization level is very high, with a small window, the additive increase mechanism goes back to a 
maximal utilization quicker than with a larger window which explains the link utilization improvement. 
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Number of users 



Figure 7: Comparison between RED and Drop Tail for NS-2 and our mean field equations. The network model 
is the one of figure JQ; RED parameters are those of paragraph 15, 2 .11 



5.2 Results with RED 

5.2.1 Configuration 

Suppose that Max t ^ = Q max = T/5 and that Mint/, = 0.2 Max t h = -4ms = 47 packets. The rule of the 
thumb of theorem 01 gives a value of .36%, which gives an insight of the value to take. We saw with NS and by 
simulating the mean field equations that .5% was also a working value whereas .75% was too high to achieve a 
stabilization in every case (but leads to small oscillations), which explains our choice: p max '■= -5%. 

Nothing changes for less than 20 users because the queue size stays below the minimum threshold of RED. 

5.2.2 RED in its working regime 

From 20 to 80 users, RED permits to have a steady state with a queue size going to its maximal value. From 
21 to 70 users the queue size goes from to 1ms, the second half of the queue size is the stabilization region 
between 71 and 80 users. This can be explained roughly by the square-root formula: the steady state value of 
the loss rate is proportional to (TC)~ 2 , when C diminishes, the loss rate increases quadratically and so does 
the queue size. This fact would advocate for an exponential shape of the loss rate function as indicated in [40]. 

5.2.3 RED working like a drop-tail 

Then, in the case 81, the simulation noise in NS makes the queue size touch the border and begin a drop-tail 
like behavior. So does the mean field simulator for 85 users. Then RED behaves like an improved version 
of a drop-tail (for less than 130 users the queue never empties) . Overall figure shows that the bandwidth 
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utilization always stays beyond 99.5%. In this state RED behaves better than drop-tail from the bandwidth 
utilization point of view, but there is an oscillation which makes it a good choice to take a small queue. 

5.3 Increasing the latency 

When one increases the latency, the relative value of Q max decreases, meaning that even with the same syn- 
chronization, the buffer does not provide the same bandwidth insurance. Another effect has to be taken care 
of: when the latency increases, the maximal bandwidth decreases, which means that more users are needed 
to reach the router capacity. When the latency is increased, the worst case for drop-tail is still at the early 
congestion stage because window sizes are huge. We saw that our RED configuration, even when not working 
in the steady state domain, gives better results in terms of link utilization. 

5.4 Mixing latencies 

When latencies are mixed, as was previously observed in the literature, the equivalent latency is the harmonic 
mean of latencies, meaning that small latencies are preponderant in the configuration of a router. This fact is 
intuitive because the small latency connections adapt to bandwidth changes quicker, and if they are stabilized 
by the controller, the set of other connections act exactly like one constant bitrate user (even if each one of 
those connections sometimes divides its bandwidth by a factor 2). We also observed that when RED was not 
acting in its steady state area, our RED configuration never acted in a worst way than drop-tail, which is due 
to the fact that p m ax is not too large. The case where RED would be worse than drop-tail would be for a too 
large value of p m ax where RED acts like a drop-tail at Mirith which means that a part of the buffer is never 
used. 

6 Conclusion 

We saw how to model accurately TCP and how to give an easy closed formula to tune RED. This lead us to 
observe a bad news about the drop-tail: the worst case for bandwidth utilization for a drop-tail is just after the 
congestion is reached. This is illustrated in our example. We saw there how to use our framework to configure 
properly RED to obtain a situation where the congestion can be supported without any loss of bandwidth for 
a very long time and without any delay oscillations. Then for extreme values, our configuration behaves not 
worse than drop-tail which is a good reason to use RED in a router. In an actual router users have multiple 
latencies, we also said briefly that if a sufficient number of low latency connections are present, then RED leads 
to a steady state. 

7 Related Works 
7.1 TCP modeling area 

The problem of ./V connections sharing one bottleneck router has been extensively studied in past years. The 
first models were made by Ott and Al. in [11,27,30,32,33]. Then some interesting studies belong to May, Bonald 
and Bolot in [28] and Vinnicombe in [42], but it appeared we owe the most promising approaches to Kelly and 
Al. [19,26] with a utility maximization problem and to Gong, Hollot, Misra and Towsley in [15,25,31] with the 
idea of introducing a fluid equation supposed to model the aggregated behavior of many TCP sources. This last 
approach motivated mathematical study of the mean field interaction to obtain accurate intrinsic equations of 
what TCP is; namely it was the study of the AIMD TCP Reno behavior (congestion avoidance [18]). 

The main works in the area are those by Tinnakornsrisuphap and Makowski [34-36] with a discrete time 
model simple yet very efficient; Srikant and Al. [13,23] with discrete time where TCP users have to compete 
against a white noise; Baccelli, Hong and Al. [3-5, 10, 17] with stochastic time steps, no buffer but an optional 
HTTP adaptation [2]; and Baccelli, McDonald and Reynier [7,29] which is the model we adapted in this article. 

We believe our model is the most efficient because we were able to use continuous times which really matters 
due to the strong dependence of the problem on delay; our model explicitly uses the TCP mechanism and we 
were able to deal with boundary effects which made it possible to study both RED (or other AQM mechanisms) 
and the drop-tail. We were also able to take into account heterogeneous sources (see [29]). This article permits 
to see one other advantage of our approach, it is easily adaptable to changes in the TCP dynamic or in the way 



INRIA 



Stability condition for RED 



19 



TCP is modeled; for example in [40], we saw how to adapt it to intermittent TCP sources (to model HTTP 
users behavior). 

7.2 Control theory applied to TCP 

Another branch of studies is the control theoretic approach used in [15] we adapted here to find stability 
conditions for the time delayed equations we dealt with; for example, the same kind method was used by Kim 
and Low in [22]. The problem of these studies is that they usually rely on a little-like formula, which leads to 
poor results when trying to compare to simulations: simulations show behaviors a lot nicer than expected. Here 
we solved this issue and found a very simple closed formula that implies stability for RED (see theorem 0}, 

7.3 Buffer sizing for IP routers 

As noticed by McKeown, Wischik and Al. in [1,24,37,38,43], the kind of scaling we do in our model can 
create problems. In core routers, slowly switching from ATM to IP, very fast and expensive memory is needed, 
and bandwidth optimization is not the first goal. In that case good overall performances can be achieved by 
choosing very small buffers at the cost of a waste of bandwidth even before the congestion level is reached. We 
did not intent to study highspeed core routers in this article. We are interested in some access routers that are 
not in the provider's backbone. The bandwidth is limited and the number of links to upgrade make it difficult 
to over provision users' needs. Then, as we saw in the simulation section, RED may be a solution to avoid the 
leverage effect at the early stages of congestion. 

8 Further Works 

Understanding exactly how to tune a router to avoid early congestion effects for HTTP users is still a challenge. 
Even if the equations are relatively easy to write (see [40] or [6] for theory and the implementation in [39]), from 
a practical point of view it is difficult to obtain accurate results. This is because of a high output dependence 
on how users are modeled, and from their statistics. For instance, determining what is a "good" distribution of 
file sizes or idle times between two downloads is not an easy task. 

Another interesting task would be to obtain easy closed formulae for drop-tail metrics such as bandwidth 
utilization. 
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